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ABSTRACT 

We identify compact groups of galaxies (CGs) within mock galaxy catalogues from 
the Millennium Simulation at z=0 with the semi-analytic models (SAMs) of galaxy 
formation of Bower ct al., Croton ct al. and De Lucia & Blaizot. CGs arc identified 
using the same 2D criteria as those visually applied by Hickson (1982) to his CGs 
(HCGs), but with a brightest galaxy magnitude limit, and we also add the important 
effect of observers blending close projected pairs. Half of the mock CGs identified in 
projection contain at least 4 accordant velocities (mvCGs), versus 70% for HCGs. In 
comparison to mvCGs, the HCGs are only 8% complete at distances < 9000 km s -1 , 
missing the CGs with small angular sizes, a strongly dominant galaxy, and (for the 
second SAM) the mvCGs that are fainter and those with lower surface brightness. 
10% of the mock mvCGs are identical to the parent virialized group, meaning that 
they are isolated, while the remainder are embedded in their parent virialized groups. 
We explore different ways to determine the fraction of physically dense groups given 
the data from the simulations. Binding energy criteria turn out to be inapplicable 
given the segregation between galaxies and dark matter particles. We rely instead on 
the combination of the three-dimensional length of the CGs (maximum real space 
galaxy separation) and their elongation along the line-of-sight (ratio of maximum 
line-of-sight to maximum projected separations), restricting ourselves in both cases to 
smallest quartets within the CGs. We find that between 64% and 80% (depending on 
the SAM) of the mvCGs have 3D lengths shorter than 200 hr 1 kpc, between 71% and 
80% have line-of-sight elongations less than 2, while between 59% and 76% have cither 
3D lengths shorter than 100/i _1 kpc or both lengths shorter than 200 /i -1 kpc and 
elongations smaller than 2. Therefore, chance alignments (CAs) of galaxies concern at 
most 40% of the mvCGs. These CAs are mostly produced from larger host groups, but 
a few have galaxies extending a few Mpc beyond the host group. The mvCGs built with 
the Hickson selection (respectively without the close projected pair blending criterion) 
have 10% higher (lower) fractions of physically dense systems. 

Key words: galaxies: clusters: general - methods: data analysis 



1 INTRODUCTION 

Compact Groups (CGs) are small, relatively isolated sys- 
tems of typically four or five luminous galaxies in close 
proximity to one another. T he first example of a CG 
was found by IStephanl (fl877h. Several catalogues of CGs 
are now available: iRosel (119771 ) and iHicksonl (| 1982h visu- 
ally identified CGs on POSS I photographic plates. Af- 
ter the Hickson compact group (HCG) catalogue, several 
CG catalogues have been automatically extracted from 
galaxy catalogues, themselves automatically extracted from 



photographic plat es: from the COSMOS/UKST Southern 



Galaxy Ca talog (|Prandoni. Iovino. fc MacGillivrav 
Iovinchooj). from the D POSS catalogue (|lovino et al.l 



1994 



2003 



de Carvalho et al1l2005l ) or CCD frames from the Sloan Dig 



ital S ky Survey (SDSS) photometric catalogue (|Lee et al.l 
l2004h . CG catalogues have also been extracte d from galaxy 
catal ogues in redshift s pace: from the CfA2 (Barton et al.l 
1 19961) . Las Campanas (I Allam fc Tucked 120001 ) . and SPSS 
(|Deng et allbOOTtlMcConnachie et al.ll2009f) surveys, as well 



as from the 3D UZC Galaxy Catalog (jFocardi fc Kelml 
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I2002I ). CGs are so compact that the me dian projected galax y 
separation in HCGs is only 39 h -1 kpc (|Hickson et al.l 19921 ). 

HCGs have been studied in detail, in particu- 
lar their internal structures, shapes, morphologies, lumi- 
nosit i es, and environments ([Hicksonl 1982j ; Hickson et al. 
19841 ; IMamonl Il986l; I Hickson et alj Il988l; IHickson fc Rood 
19881: iMendes de Qliveira fc Hicksonl Il99ll: IZepJ Il993l: 



Moles et alj 119941: iPrandoni et alj 11994 
20041 : iTovmassian et alj|2006h . Tosummarise. these studies 



Kelm fc Focardil 



indicate that CG galaxies have star formation properties, 
colours and morphological mixes that lie in between binary 
galaxies and isolated ones. 

The nature of the CGs has been a puzzling 
matter for quite some time. How can a few bright 
galaxies coexist within less than 100 kpc, given that 
galaxies are expected to merge f ast i n such s y stems 
dCarnevali. Cavaliere. fc Santangelol 198ll : I Barnes! 1 19851 : 
iMamonll 19871 : iBode. Cohn. fc Luggerll 19931 )? There are three 
schools of thought on this matter. One view is that compact 
groups are recently formed de nse systems that are ab out to 
coalesce into a single galaxy (|Hickson fc Roodlll988f ), The 
galaxies lost in the merger ma y be replenished by galaxies 
in th e loose group environment (|Diaferio. Geller. fc Ramellal 
1 19941 ). and the predicted rate of formation of CGs ap- 
pears to b e sufficient to explain the observed frequency 
of HCGs (|Mamonl |200Ch . The second view states that 
CGs may b e transient unbound cores of looser group s 
<|Rosel Il977l ; iRamella et al. I I1994J : ITovmassian et al.ll200j ). 
And the third scenario places CGs as chance align- 
ments of gal axies along the line of sight within larger 
loose groups (lRoselll977f for CGs elonga ted in projection; 



Ma 



tcrs 



19861 and I Walks fc Mamonl 1 19891 in general), clus- 
Walke fc Mamonl Il989h~ and cosmological filaments 



|Hernquist. Katz. fc Weinberg! Il995h . In this scenario, the 
numerous signs of interaction and star formation is ex- 
plained by the frequent o ccurrence of b inaries and triplets 
in the chance alignments (|Mamonlll992l ). 

If CGs are physically dense, their dynamical times 
should be short (1% of the age of the Universe), and the 
hot intra-group gas should trace a smooth gravitational po- 
tential. The launch of X-ray observatories with good sen- 
sitivity in the soft X-ray band (ROSAT, ASCA, Chan- 
dra and XMM-Newton) has led to the detection of hot X- 
ray emitting gas from many CGs. Since the X-ray emis- 
sivity scales as the square of the gas density, X-ray emis- 
sion is le ss prone to projection effects than optical surveys 
(but see lOstriker. Lubin. fc HernquistHl995T ). However, al- 
thoug h 22 HCGs were det ected out of 32 pointed observa- 
tions (|Ponman et al.lll996l ). it is not clear what the global 
fraction of detections would be on the full sample of HCGs 
(69 [92] gr oups with at least 4 [3] accordant velocities, ac- 
cording to lHickson et al.1ll992f ). Moreover, some of the de- 
tected groups appear clum py (e.g. HCG 16 according to 
iDos Santos fc Mamonl [19991 ). which strongly suggests their 
unvirialized state. 

The distinction between compact groups that are dense 
in 3D, or chance alignments within loose groups or longer 
filaments is difficult, because redshift space distortion intro- 
duces uncertainties in the computation of the line of sight 
coordinate which might result in misidentified compact con- 
figurations. For a group with line of sight velocity disper- 
sion er„, the redshift distortion will amount to a spread 



of 8r z = <j v /Ho in the line of sight coordinate. Assuming 
that the square velocity dispersion is half the square cir- 
cular velocity at the virial radius, a 2 = (1/2) GM(r v )/r v 
(appropriate for an p oc 1/r 2 density profile), one finds 
Sr z /r v = (1/2) \/T00 = 5 if the virial radius is defined where 
the mean density at that radius is 100 times the critical 
density of the Universe. So redshift space distortions pre- 
vent measuring distances within virial systems (see also the 
Introduction of lWalke fc Mamonl [l989l ). 

Nevertheless, there is o ne CG meeting the HCG crite- 
ria discovered by one of us <|Mamonl Il989 ) that is so close 
(within the Virgo cluster) tha t surface brightn ess fluctua- 
tion distance measurements by Mei et al. ([20071 ) are able to 
settle the issue of its nature: IMamonl \ 20081 ) concludes that 
this CG is a chance alignment of galaxies along the line of 
sight, at least 440 kpc and most probably 2 Mpc long. 

In summary, even though many efforts have been de- 
voted to look for an explanation about the nature of CGs, 
the debate is still wide open. 

The advent of increasingly realistic cosmological sim- 
ulations now allow one to distinguish whether CGs are 
truly dense in 3D, or caused by chance alignments within 
looser groups, filamentary str uctures, or the general field. In 
an ear ly pioneering attempt, iHernquist. Katz. fc Weinberg! 
jl995h . who identified galaxies as dense knots of cold gas in 
their iV-body + hydrodynamical simulation, and searched 
for CGs in redshift space in many viewing directions. They 
found four CGs with at least 4 accordant velocities, all of 
which were longer than 2/i _1 Mpc along the line of sight 
(one was as long as 4 ft -1 Mpc), and yet presented accor- 
dant velocities, despite the (Hubble law) stretching of veloc- 
ities cause d by their elongat ion along the line of sight. The 
analysis of IHernquist et all suffers from several drawbacks 
(according to present-day standards for cosmological simu- 
lations): the simulation box was small (44/i _1 Mpc wide), 
the mass resolution was poor (their simulation had 32 3 dark 
matter particles and 32 3 gas particles, and their galaxies 
were identified with as few as 8 gas particles), and the spa- 
tial resolution was poor (the dark matter particles had a 
softening length of 10/i _1 kpc). Furthermore, the identifi- 
cation of galaxies with knots of dense gas was not optimal, 
especially that feedback from supernovae and active galactic 
nuclei were not incorporated. 

In this work, we quantify the fraction of CGs that can 
be considered as physically dense entities in samples of au- 
tomatically identified CGs, based upon more realistic cos- 
mological N body simulations. At present, one can build 
realistic CGs in two ways: 1) from dissipationless cosmolog- 
ical simulations on top of which galaxies are painted using 
fairly complex semi-analytical galaxy formation/evolution 
models; 2) from hydrodynamical codes that resolve galaxies. 
We have chosen the first approach and use for this purpose 
the largest cosmological N body simulation ever performed 
(in 2 006, when the presen t study began), the Millennium 
Run l|Springel et al.ll2005h . on which galaxies were identi- 
fied in three ways, using three different state-of- the-art semi- 
analytic models (SAMs) of galaxy formation ([Bower et all 
12009 ; ICroton et al"1l2006l ; |Pe Lucia fc Blaizotl 120071 ). 

These three galaxy samples provide an opportunity to 
test both, projection effects and the real nature of systems 
identifie d using standard algorithms like that proposed by 
IHickson! (| 1982T ) . The CGs are identified in mock redshift- 
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space catalogues constructed from the real-space galaxy 
sample derived with the semi-analytical model, from the 
Millennium Run. 

In comparison with the analysis of lHernquist et all our 
study is based upon a simulation in a box whose volume is 
over 1 million times greater, with 30 thousand times as many 
particles, 25 times finer mass resolution and a softening scale 
4 times smaller. However, the simulation we use does not 
contain gas particles, so the galaxy parameters are highly 
dependent on the physics of galaxy formation and evolution 
of the three SAMs that we analyse. 

We focus here on the HCG catalogue, which is by far 
the best studied sample of Compact Groups. 

The layout of this paper is as follows. In Section 
we present the different steps for the construction of the 
mock CG catalogue and Sect. [3] describes how the resulting 
CGs are classified. The conclusions are summarised and dis- 
cussed in Section [4] Once our analysis was we l l adva nced, 
we learnt about the work of lMcConnachie et al.l(j2008l ), who 
performed a si milar analysis of the proper ties of Hickson-likc 
CGs from the lDe Lucia fc Blaizotl (|2007t ) galaxy catalogue, 
and found that 70% of the mock CGs selected in projection 
were caused by chance alignments of galaxies. We highlight 
in Sect. l4.2l the similarities and several important differences 
between our two studies. 



2 CONSTRUCTION AND CLASSIFICATION 
OF THE COMPACT GROUP SAMPLE 

2.1 Observed compact group sample 

We use the HCG cata logue of compact groups , with 
photometry measured bv lHickson. Kindl. fc Aumanl d 1989 ) 
in the B and (presumably J ohnson) R ba n ds. | Hicksonl 
1 19821 ) found 100 HCGs, and Iffickson et all (|l992l ). who 
measured the redshifts for virtually all galaxies, built 
a velocity sample of vHCGs by eliminating galaxies ly- 
ing at more than 1000 km s -1 from the group's median 
velocity. In this manner, they obtained 92 HCGs with 
at least 3 accordant velocities and 69 HCGs with at 
least 4 accordant velocities. We extracted the photometry 
and velocities using Table VII/213/ galaxies in VizieRij 
(|Ochsenbein. Bauer, fc Marcoutll2000l ). This database con- 
tains the velocities for all galaxies except 6. We found the 
redshifts for these 6 galaxies (Tabled} in the NASA/IPAC 
Extragalactic Database 

(NED 

For future comparisons with the SDSS, we choose a 
Johnson _R-band magnitude limit of 17.44 that mimics the 
SDSS spectroscopic magnitude limit of r < 17.77 (see ap- 
pendix [S} . We measure the total raw magnitude Rt and 
the extinction-corrected magnitude R T using 

R T ~ B T — (B — R) , 
R° T ~ B T ~(B-R) + E B -r , 

A R /Av 



Eb-r 



f Bt — Bn 



Ab/Av _ 

where Bt, B t and B — R are the raw total blue magnitude, 



1 http:/ /webviz.u-strasbg.fr/viz-bin/VizicR 

2 http://nedwww.ipac.caltech.edu 



Table 1. Additional HCG galaxy redshifts found in NED 
Reference 



Falco et al. (1999) 
de Carvalho et al. (19971 
de Carvalho et al. (19971 
i d£\^mcmfieurs_et al. (1991) 
Hickson (19931 ; Barton et al. (19981 
Hickson (19931 



Galaxy 


V 


«(«) 




(kms" 1 ) 


(km s~ 


18b 


4105 


25 


19c 


4253 


23 


19d 


20443 


26 


51g 


7532 


11 


57h 


9240 


105 


lOOd 


5590 


32 



Notes: The radial heliocentric velocities and their errors are 
jiven in columns 2 and 3, respectively. For HCG57h, the velocity 
is from the first reference, while the error is from the second. 



extinction-corrected total blue magnitude and isophotal B — 
R colour, all given in VizieR. 

We first note that only 83 HCGs out of the original 
993 have at least 4 galaxies whose extinction-corrected R- 
band magnitudes are within 3 mag from the brightest oneQ 
Six o f the HCGs d o not satisfy the HCG isolation crite- 
rion (|Sulentidll997l ) and these were also omitted from our 
sample. On the other hand, we re-inserted into our sam - 
ple HCG 31, which has additional members |Sulentidll987l ). 
among which two additional members within 3 magnitudes 
from the brightest member: gala xies G and Q. for w hich we 
adopt the J?-band photometry of lRubin et al.l (Il990l). while 
the ra dial velocities are taken from lMendes de Oliveira et al] 
|2006l) . and convert to R T using the median difference for 
other galaxies: R T = R — 0.38. 

We are left with 72 HCGs whose brightest magnitude 
satisfies Rt, < 17.44 — 3 = 14.44, thus ensuring completeness 
out to R = 17.44. We call this the pHCG sample (for HCGs 
defined in projected space). 

Only 52 among the 72 pHC'Gs have at least 4 galaxies 
within 1000 km s -1 from the median group velocity (here- 
after the vHCG sample, for velocity-selected HCG). 



2.2 Basic scheme for mock compact group 
samples 

Our mock catalogues of CGs are built in several steps, in 
which we: 

(i) simulate the gravitational evolution of a large piece 
of the Universe, represented by collisionless (dark matter) 
particles; 

(ii) attach galaxies to the simulation with a semi- 
analytical galaxy formation model 

(iii) convert to a mock galaxy catalogue in redshift space; 

(iv) convert to a mock 2D CG catalogue (hereafter mpCG 
for mock CG in projection), by applying the HCG selection 
criteria; 

(v) convert the mpCG catalogue to a velocity-filtered 



3 We have omitted group HCG 54, which is the HCG with the 
smallest projected radius, as it appears to be eithe r a group of H II 
regions in a single galaxy {Xrkhipova ct al. 198 J) or the end re- 
sult o f the merger of two disk galaxies jVerdes^Montenegro et al] 
1200211 . 

4 This is also clea r in the Bt and B^, magnitudes given in 
iHickson et al.|[l989l . althoug h this was not d iscussed by these au- 
thors, but was also noted bv lSulentid Jl997f) . 
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mock CG catalogue (hereafter, mvCG for mock velocity- 
filtered CG), by removing galaxies with discordant redshifts; 

(vi) convert the mvCG catalogue to a mock velocity- 
filtered HCG catalogue (hereafter, mvHCG for mock 
velocity-accordant Hickson Compact Group), by randomly 
selecting groups according to the completeness of the HCG 
as a function of group surface brightness, brightest galaxy 
magnitude and its contribution to the total group luminos- 
ity. 

The last step is motivated by the strong incompleteness of 
the HCG catalogue in surface magnitude and galaxy mag- 
nitude (see Sect. 12.71 below). 

A list of the different acronyms used to refer to the 
different samples is provided in Tabled 



2.3 Dark matter particle simulation 

We use the Millennium Simu lation, w hich is a cosmologi- 
cal Tree-Particle-Me sh (TPM, |Xu|[l995|) iV-body simulation 
jSpringel et alj|2005f ). which evolves 10 billion (2160 3 ) dark 
matter particles in a 500 h~ x Mpc periodic box, using a co- 
moving softening length of 5 h - 1 kpc0 The cosmological pa- 
rameters of this simulation correspond to a flat cosmological 
model with a non- vanishing cosmological constant (ACDM): 
fi m = 0.25, fi A = 0.75, a 8 = 0.9 and h = 0.73. The simula- 
tion was started at z = 127, with the particles initially posi- 
tioned by displacing particles initially in a glass- like distribu- 
tion according to the ACDM primordial density fluctuation 
power spectrum. The 10 9 particles of mass 8.6 x 10 8 ft _1 M o 
are then advanced with the TPM code, using 11000 inter- 
nal time-steps, on a 512-processor supercomputer. The po- 
sitions and velocities of the 10 billion particles were saved 
at 64 epochs (leading to nearly 20 TB of data). 



2.4 Modelling galaxies 

We consider the z = outputs from three di fferent SAMs 
of gal axy formation by Bower et al. (|2006t ). ICroton et al.l 
(|2006h . and lDe Lucia fc BlaizotJ (|2007l V (B06, C06 and DLB, 
respectively), where each model was applied in turn to the 
outputs of the Millennium Simulation described above. Note 
that, while the B06 and C06 models were developed inde- 
pendently, the DLB model is essentially the same as the C06 
model, except that the merger rate is reduced by a factor 2, 
the magnitudes are derived using spectral sy nthesis models 
based up on a different i nitial mass function (|Chabrierll2003l 
instead of lSalpeterlll955l ) with fewer low mass stars, and the 
treatment of radiative transfer to dust is much more refined. 

The three SAMs produce galaxy positions, velocities, 
as well as absolute magnitudes (in five or more optical and 
near-infrared wavebands, all including Johnson R), as well 
as other quantities. To summarise, the branches of the halo 
merger tree (produced by the Millennium Simulation) are 
followed forward in time, and the following astrophysical 
processes are applied: gas infall and cooling, early reheat- 
ing of the intergalactic medium by photoionization, star 



formation, black hole growth, AGN and supernova feed- 
back, galaxy mergers, spectro-photometric evolution, etc. 
The model parameters have been adjusted to produce a good 
match to the observed properties of local galaxies. In these 
SAMs, AGN feedback is responsible for the absence of cool- 
ing flows in rich clusters, for the cut-off at the bright end 
of the galaxy luminosity function and for the number den- 
sity properties of the most massive galaxies at all redshifts. 
Also, the early reheating of the IGM by photoionization is 
responsible for suppressing gas cooling in halos below a cir- 
cular velocity that is independent of redshift (or nearly so). 
The 3 SAMs produce z — galaxy luminosity functions that 
are in good agreement with observations in both the bj and 
K wavebands (f| with an excess of galaxies at very bright 
luminosities for all 3 models and a slight excess at faint lu- 
minosities for the C06 and DLB models. Moreover, the B06 
SAM provides several other observational predictions: the 
bj and K galaxy luminosity functions at higher redshifts, 
the global history of star formation, and the local black hole 
mass vs. bulge mass relation. 

All three SAMs produce around 10 million galaxies at 
z = 0. The galaxy samples appear to be complete at least to 
AIr — 5 log h < —17.4 with stellar masses 
(C06) or M* > 3 x 10 8 h^ 1 M e (B06, DLB). 

Each of the three SAMs has its strengths and weak- 
nesses. The B06 model computes galaxy mergers by infer- 
ring the positions of galaxies in their halo through typical 
values of their energies and angular momentum in units di- 
mensioned to the virial scales of the halos. In contrast, the 
C06 and DLB models have the advantage of estimating the 
merger rates directly from the positions of subhaloes in the 
dark matter simulation. They both use the same analyti- 
cal formula for the orbital decay time by dynamical friction 
once the subhalo masses fall below their resolution limit, 
where the DLB time is twice the C06 time, which itself 
match es almost perfectly the decay time that Ijiang et al.l 
(|2008h calibrated on high-resolution cosmological hydrody- 
namical simulations. Unfortunately, C06 do not provide the 
galaxy merger trees, so it is difficult to derive the history of 
star formation of a given galaxy. Also, while B06 find a Red 
Sequence with increasing red colours for increasingly higher 
stellar masses, the C06 catalogue shows a colour-luminosity 
relation for the Red Sequence galaxies that flattens at high 
luminosity, contrary to observations, and a s imilar effect is 
seen in the DLB galaxy output (as shown bv lBertone et all 
l2007h . Still, the B06 colours are too blue and fit somewhat 
less well the SPSS colour distribution th an do the DLB 
colour s (|Mateus et alJl200S( ). The SAM of ICattaneo et al.1 
(|2006h reproduces better the colours of galaxies, but its out- 
put is not public and the galaxy positions are determined 
stochastically (like B06) rather than by following the dark 
matter subhaloes (like C06 and DLB). The DLB catalogue 
produces galaxies whose present-day small-scale segregation 
of recently formed stellar mass is too large, w hile that of B0 6 
matches well that observed with the SDSS (|Mateus et all ). 
This is surprising given that the B06 model treats galaxy 
mergers using stochastic positions rather than the positions 
of the subhaloes with which the galaxies are associated (see 



5 The Millennium Simulation, run by the 
Virgo Consortium, is publicly available at 
http: / / www. mpa-garching. mpg.de / millennium 



The z = lumi nosity function of the 



model is given bv lBertone et al 



De Lucia fc Blaizotl (2007) 
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Table 2. List of acronyms used throughout this work 



CG 



general compact groups 



HCG Hickson compact groups 

pHCG HCGs that strictly meet the Hickson (1982) criteria +R b s£ 14.44 
vHCG velocity accordant pHCGs 



pmpCG particle mock projected compact groups, which strictly meet the Hickson (1982) criteria +Rt ^ 14.44 

mpCG observable mock projected compact groups (same as pmpCGs, but accounting for galaxy confusion) 

pmvCG particle mock velocity accordant compact groups 

mvCG observable mock velocity accordant compact groups (same as pmvCGs, but accounting for galaxy confusion) 

mvHCG observable mock velocity accordant compact groups with Hickson's biases 



CA chance alignment of galaxies 

CALG chance alignment of galaxies within looser groups 
CAF chance alignment of galaxies within filaments 



PG3D Parent groups identified in real space 



above). However, the DLB model predicts a little better 
than B06 the analogous s egregation for intermediate age 
(0.2 — 0.5 Gyr) stellar mass l)Mateus et al] ). But the present- 
day galaxy merger rate of DLB appears too low, while that 
of B06 matc hes well the o bservations of the frequency of 
galaxy pairs (|Mateusll20"oi ). 

In summary, it is very difficult to decide which of the 
three SAMs is most appropriate for our study of CGs, and 
we therefore decided to analyse the outputs of all three of 
them. We will find and illustrate several important differ- 
ences in the properties of mock CGs predicted from these 
three models. 

2.5 Mock galaxy catalogues 

Using the snapshots at z = 0, we construct mock catalogues 
in redshift space. For the three SAMs, we obtain redshifts by 
adding the Hubble flow to the peculiar velocities projected in 
the line of sight direction. We compute the observer-frame 
galaxy apparent magnitudes from the rest-frame absolute 
magnitudes provided by the semi-analytical model. These 
apparent magnitudes are convert ed to the obser ver frame 
using tabulated k + e corrections (|Poggiantilll997T l. 

Our mock catalogue is constructed by viewing the full 
volume of the simulation box from one of its 8 vertexes 
(z min ~ 0.17, 7r/2sr = 5156 deg 2 ) We set an apparent mag- 
nitude limit R = 17.44, equal to the limit we set on the HCG 
groups to match the SDSS spectroscopic catalogue for later 
comparisons (see Sect. 12, I'll , 

In order to increase the statistical significance of our re- 
sults, we considered eight observers situated at the eight ver- 
texes of the simulation box, and all the identification proce- 
dures were performed on these eight samples to finally com- 
bine the resulting CGs into one larger sample. These eight 
samples are almost fully statistically independent, since 3/4 
of the mock CGs selected in the cone (see Fig. [3| he within 
half the box size (256 /i -1 Mpc) (see upper right panel of 
Fig- [21) ■ Table summarises the main properties of the three 
mock galaxy catalogues seen from one of its vertexes. 

The completeness of our magnitude+volume limited 
mock catalogues might be an important issue that could 
bias the results. The implications on the results of using 
magnitude+volume limited samples will be carefully tested 
in Sect. 

We identify regular groups of galaxies in the simulation 
box by applying a Friends-of-Friends (FoF) algorithm in real 



Table 3. Mock galaxy catalogues (R < 17.44) 



Mock 


# 


^mcd 


"90 








(h 3 Mpc" 3 ) 


Bower et al. (2006) 


556 224 


0.0998 


0.062 


Croton et al. (2006) 


446 153 


0.0969 


0.059 


De Lucia & Blaizot (2007) 


1034 619 


0.1114 


0.089 



Notes: #: number of galaxies seen from a single vertex of the 
simulation box, z mc( j: median redshift, ngo- space number density 
within 90 h^ 1 Mpc, defined in equation JTJ. 



space (|Davis et al.lll985l ) to the galaxies. We adopt a linking 
length of I = 0.17 rC x l' A . where n is the mean space density 
of galaxies. The factor 0.17 roughly corresponds to an over- 
density of 100 relative to the critical density of the Universe, 
roughly the minimum overdensity (hence maximum radius) 
where cosmological struc tures are in dynamic al equilibrium 
jBrvan fc Normanl fl99^ . but recent work by ICuesta et al.l 
|200S| shows that on the mass scales of groups, the radius of 
equilibrium is roughly 30% greater) . We denote these groups 
the PG3Ds for Parent groups selected in real space, and will 
later check if the mock CGs extend beyond these PG3Ds. 

2.6 Mock compact groups selected in projected 
space 

In this work, we use an automat ed mpCG searc h algorithm 
very similar to that described bv lHicksonl |l982l ). applied to 
the three mock galaxy catalogues. The algorithm defines as 
mpCGs those systems that satisfy the following conditions: 

(i) 4 s; N ^ 10 (population) 

(ii) )ir < 26 mag arcsec -2 (compactness) 

(iii) 6m > 3#g (isolation) 

(iv) brightest 14.44 (flux limit) 

where 

• N is the total number of galaxies whose 7?-band mag- 
nitude satisfies R < -Rbrightest + 3, where -Rbrightcst is the 
magnitude of the brightest galaxy; 

• fj,R is the mean i?-band surface magnitude, averaged 
over the smallest circle circumscribing the galaxy centres. 

• 0g is the angular diameter of this smallest circum- 
scribed circle; 

• 6 m is the angular diameter of the largest concentric 
circle that contains no other galaxies within this magnitude 
range or brighter; 
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Figure 1. Flowchart of the selection of projected compact groups. 
All magnitudes must be brighter than our chosen global magni- 
tude limit. The dashed portion allows for compact groups con- 
taining isolated compact subgroups. 



Note that the fourth criterio n (which impl ies -Rfaintcst ^ 
17.44) was not considered by iHicksonl j 19821 ). This restric- 
tion is fundamental for avoiding selection biases, as will be 
demonstrated in Sect. 12.81 

The main steps of this algorithm are summarised in the 
flowchart of Figure [T] 

Now, some CGs meeting Hickson's criteria might be 
embedded within larger CGs that also meet Hickson's crite- 
ria (with larger isolation annuli). For such groups, we thus 
have two choices for our CG selection algorithm: select the 
smaller (sub-)group (solid portion only in the flowchart of 
Figurefl]) or the larger group (with the dashed portion of the 
flowchart of Figure [T]). The percentages of CGs containing 
smaller CGs are 13%, 10% and 6% for B06, C06 and DLB 
models, respectively. The HCG sample was selected accord- 
ing to the larger group (P. Hickson, private communication) . 
However of the 100 groups in the original HCG sample, only 
one has a definite subgroup (HCG 17). Therefore, it is not 
clear that P. Hickson always followed the larger group algo- 
rithm. In what follows, we adopt the larger group algorithm 
(i.e. including dashed portion of the flowchart of Figure [T]). 
However, our results turn out to depend little on the choice 
among these two algorithms. 



To accelerate this algorithm, we have used the subrou- 
tines of the HEALPbjj] package to find neighbours, and the 
STRIPACK0 subroutines to compute the centres and radii 
of the minimum circles. Given that our mock catalogues 
have edges (the limits of the cone), we discarded CGs ly- 
ing near the edges since those groups will be fictitiously iso- 
lated. Then, we kept with a safe sample of CGs that lies in 
the range a > 5° & a < 85°, and 5 > 5° & S < 85° (solid 
angle Afi = 1.2693 sr). 

Using this algorithm, we find 7580, 4756 and 15383 
mock CGs in the B06, C06 and DLB samples, respectively. 

Now, the galaxies in the mock galaxy catalogues are 
simply point particles. However, when one observes two 
galaxies that lie so close in projection on the plane of the sky 
that their isophotes overlap, they risk being blended into a 
single object. This galaxy confusion can be important for 
CGs, which by definition often have overlapping isophotes. 
For example, observed CG catalogues should have fewer very 
dense groups than mock CG catalogues. We therefore in- 
cluded one extra observability criterion: two galaxies are 
confused and blended if their projected separation is smaller 
than the sum of their half light radii, in which case we sum 
their luminosities and adopt the redshift of the most lumi- 
nous galaxy. For galaxies in the mock catalogues, the half 
light radii were computed as a functio n of their a b solute 
magnitude in the R-band, according to IShen et al.l (|2003l . 
eqs. [14] and [15] therein). 

Hereafter (see Table [2]), we refer to these observable 
mock projected compact groups as mpCGs and denote the 
original ones as pmpCGs (for particle-mock-projected com- 
pact groups). The mpCGs are built with the Hickson criteria 
given at the beginning of Sect. 12. 61 and thus contain at least 
4 galaxies (after the pair-blending procedure). We under- 
stand that our pair-blending criterion is simplistic and may 
be somewhat liberal in defining confused galaxy pairs. In re- 
ality, observed CGs should lie in between our pmpCGs and 
our mpCGs, but probably much closer to the mpCGs. We 
therefore adopt the observable criterion (hence, the mpCGs) 
in what follows, unless explicitly stated otherwise. 



2.7 Mock compact groups after velocity filtering 

We then built a sample of velocity-filtered mock compact 
groups on top of the respective pmpCG and mpCG samples, 
which we call the pmvCG {particle-mock-velocity-filtered 
compact group) and mvCG (mock velocity-filtered compact 
group) sample s (see Table \2 1 wit h the following iterative 
procedure (see iHickson et al.l 19921 ) : 



• Compute the median velocity of the group, tWdian- 

• Discard those galaxies with | u — w mc dian | > 1000 kms -1 . 

• If at least n m i n galaxies remain, iterate until no galaxies 
are dropped or the group disappears (n < n m i n ), 

• Save those CGs that have at least n m i n galaxies and 
that satisfy the compactness criterion. 

We call n the number of accordant-velocity galaxies in the 
mvCG and adopt n m i n = 4 as our minimum number of ac- 
cordant velocities. 



7 http:/ /hcalpix. jpl.nasa.gov/index.shtml 

8 http: / /people. sc. fsu.edu/~burkardt/f_src/stripack/stripack. html 
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Table 4. Compact group samples 



Sample Type gals 


V 




JV 






"90 








filter 








(io- 


5 h 3 Mpc- 3 ) 


pHCG obs 


ext 


no 




72 






1.9 




vHCG obs 


ext 


yes 




52 






1.1 










B06 


C06 


DLB 


B06 


C06 


DLB 


pmpCG mock 


par 


no 


7580 


4756 


15383 


49 


33 


31 


mpCG mock 


ext 


no 


3574 


3265 


4729 


18 


23 


11 


pmvCG mock 


par 


yes 


4553 


2685 


5646 


13 


29 


23 


mvCG mock 


ext 


yes 


2073 


2095 


2825 


16 


22 


10 


mvHCG mock 


ext 


yes 


272 


223 


291 


5.1 


5.4 


2.6 



Notes: Col. 1: sample; col. 2: sample type (obs=observed); col. 
3: galaxy type (par=particle; ext=extended); col. 4: velocity fil- 
ter; col(s). 5: number of groups (summed over 8 vertexes for 
mocks); col(s). 6: space density within 90 /i~ 1 Mpc (eq. [I], di- 
vided by 8 for the mock samples to take into account the 8 ver- 
texes from which t hey were selec ted) . C olumns with 3 values show 



the results for the 



Bower et aD (B06), ICroton et all (C06), and 



|Pe Lu cia & Blaizot (DLB) SAMs, respectively. The limiting sur- 



face magnitude of the mvHCG sample is not sharp. 

Table [4] shows the number of groups in the observed 
and mock CG samples. The percentage of mpCGs that sur- 
vive the velocity-filtering is 58% (B06), 64% (C06), and 60% 
(DLB), so that our final samples of accordant velocity CGs 
contain from ~ 2050 to ~ 2800 mvCGs depending on the 
adopted SAM. 

The main properties of the mvCGs identified in the 
DLB galaxy catalogue are shown in Figure [2] together with 
the observed distribution of vHCGs (the distribution of the 
mvCGs obtained with the other two SAMs are similar, ex- 
cept for radial velocity distributions that are more skewed 
to lower values and considerably more groups in the bin of 
lowest group surface brightness). Hickson's visual selection 
of CGs produced a catalogue that is incomplete at small 
angular sizes (middle left plot), faint brightest galaxy mag- 
nitudes (middle right plot), and in groups with a dominant 
brightest galaxy (bottom left plot). We will quantify the 
completeness of the HCG in Sect. 12.91 



2.8 Testing the volume-limited sample 

As we shall now see, limiting the depth of our galaxy sam- 
ple to the simulation box provides a complete list of mvCG 
candidates. However, our neglect of galaxies further than 
the box size may prevent distant galaxies from spoiling the 
isolation of some of the mvCGs. Although the DLB model 
is available in an observing cone, this is not the case for 
the other two SAMs, and while we can construct a cone 
ourselves by placing galaxies of previous time-steps at the 
position corresponding to their lookba ck times, we do not 
have access to the z > outputs of the ICroton et alj (|2006l ) 
model to do this, and we wish to consider all three SAMs in 
parallel. 

We therefore use the DLB model to build a mock sample 
of galaxies within a cone, built of shells constructed from dif- 
ferent snapshots corresponding to the epoch of the lookback 
time at their distance. Here we use the 17 last snapshots, 
bringing us to a maximum redshift of z = 0.68, where the 
minimum luminosity, Mr = —24.91, corresponds to 45 L*. 

The top panel in Fig. [3] compares the radial velocity 
distribution of mpCGs extracted from the volume limited 
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Figure 2. Distributions of properties of the velocity selected 
CGs: mock mvCGs from the De Lucia & Blaizot (2007) model 
{thin red histograms) and observed vHCGs (thick black dashed 
histograms). The properties are the group multiplicities (top left 
panel), radial velocities (top right panel), angular group diameters 
(middle left panel), brightest galaxy magnitudes (middle right 
panel), magnitude differences between the brightest galaxy and 
the full group (bottom left panel), and group surface brightness 
(bottom right panel). Error bars correspond to Poisson errors. 



mock catalogue described in Sect. 12.51 (solid lines) seen from 
one of its vertexes with that obtained from groups identi- 
fied in a magnitude limited mock catalogue or light cone of 
the same solid angle (dashed lines). It can be clearly seen 
that the mpCGs obtained from the light cone and from the 
volume limited catalogue before the i?bri g htcst cut-off (thin 
lines) are quite different. First, the cone sample is able to de- 
tect a large number of mpCGs beyond the limits of the box. 
On the other hand, fewer mpCGs are identified in the cone 
sample at small distances (up to 80% of the box size). This 
lower abundance of mpCGs in the cone sample is the conse- 
quence of distant galaxies spoiling the isolation criterion of 
many mock compact groups. The total number of mpCGs 
in the cone sample is 4% lower than in the box sample. 
These differences become more pronounced when the flux 
limit is applied (thick lines). The number of mpCGs in the 
cone sample is now 38% lower than that of the box sample. 
Interestingly, the cone sample of mvCGs (with the bright- 
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Figure 3. Radial velocity distributions of mpCGs (top panel) 
and mvCGs (bottom panel) built from a volume-limited catalogue 
(solid lines) and from a light cone (dashed lines), both having 
R < 17.44. These catalogues were constructed from the DLB 
SAM. 



est galaxy magnitude limit applied) shows a lack of CGs at 
all distances in comparison with the analogous box sample. 
The total number of mvCGs in the cone sample is half that 
of the analogous box sample. This indicates that our mpCG 
box-sample catalogue is only 62% reliable against contami- 
nation of the isolation annulus by distant interlopers, while 
our mvCG box-sample catalogue is only 50% reliable. 

We can use this comparison of box and cone samples 
to correct the fraction of mpCGs that survive the veloc- 
ity filter and make it as mvCGs: this fraction becomes 
0.6 x 0.5/0.62 = 48% ± 1 (where the error is from bino- 
mial statistics and neglects the systematic error from the 
cone to box correction). 



2.9 Completeness 

2. 9. 1 Measure of completeness 

It is interesting to compare the space density of mpCGs and 
mvCGs with those of the observed HCGs, selected in the 
same way. We estimate for the mock and observed samples 
the mean total surface density of CGs, as well as the mean 
space density of CGs within the lowest median distance of 
all samples, which is a fairly robust measure of density. The 
adopted distance is 9000 kms~ , which is close to the median 
of the vHCG sample, so the space density is computed as 



3JV (v < 9000 kms -1 ) , 

903 A» ^MPC- 3 



(1) 



where v is the median velocity of the group members, while 
A is the solid angle of the sample (in sr). 



2.9.2 Projected compact groups 

For the mpCGs, we obtain 1.8, 2.3 and 1.1 x 10~ 4 h 3 Mpc~ 3 
(see Tableg]) using the SAMs by B06, C06 and DLB, respec- 
tively. In comparison, the HCGs were selected on the POSS I 
plates, spanning 9.7 sr = 32 000deg 2 (Dec > -33°). For the 
72 pHCGs, the mean density is n 90 = 1.9 x 10" 5 h 3 Mpc -3 
(Table |4]), i.e. typically 9 times lower than the values ob- 
tained from the 3 samples of mpCGs. 

Now, within a limiting distance of v = 9000 km s -1 , we 
found (Fig. [3} 17 mpCGs in our light cone in comparison 
with 22 in one of our boxes, again because our box sample 
misses possible distant interlopers that spoil the CG iso- 
lation. This suggests that we would have found 23% fewer 
mpCGs, had we not limited ourselves to the box. We deduce 
that the observed pHCG sample is (l/9)/(17/22) ~ 14% 
complete at this limiting distance (which again corresponds 
to the median distance of the HCG catalogue). 

Hickson's inclusion of the Galactic Plane should lead to 
underestimates of the completeness of roughly 1/3, which 
is the fraction of his search area (S > —27° covered by the 
POSS I survey) with low galactic latitudes |6| < 20°. There- 
fore, the bulk of the incompleteness of the HCGs lies in the 
incomplete visual selection at high galactic latitudes. 



2.9.3 Velocity-filtered compact groups 

We now compare the space density of mvCGs with that of 
the vHCG sample. For our mock samples of mvCGs with at 
least 4 accordant velocities, the space densities ngo are (Ta- 
ble [3J 1.6, 2.2 and 1.0 x 10~ 4 h 3 Mpc~ 3 , for B06, C06 and 
DLB, respectively. For comparison, for the 52 vHCGs (de- 
fined with at least 4 accordant velocities and with brightest 
galaxy magnitude brighter than 14.44), the space density is 
n 90 = 1.1 x 10 _5 /i 3 Mpc~ 3 (Table SJl- Therefore, the space 
density of mvCGs selected in the box is typically 15 times 
that of the observed vHCG. 

However, within v < 9000 km s -1 , we found 16 mvCGs 
in our light cone versus 20 (20% more) in our box (for a 
single vertex as observation point). This suggests that we 
would have found 20% fewer mvCGs, had we not limited 
ourselves to the box (thus allowing for distant galaxies to 
spoil the isolation of these 20% of the mvCGs). Therefore, 
we deduce that the completeness of the vHCG sample is 
1/15/(16/20) = 8%. Note that we assumed that the con- 
tamination of distant galaxies of the isolation criterion of 
mpCGs and mvCGs is independent of the SAM, even if we 
only measured this effect with the DLB model. 

The top panels of Fig. [4] show the completeness of 
the velocity-filtered Hickson sample as a function of ra- 
dial velocity for the 3 SAMs. The completeness is defined 
as C(v) = <f£°/Kj 1 B°, where Ho = 100 km s- 1 Mpc" 1 . 
The green arrow shows the limit of our nearby subsample 
(see Table Q. The next 5 rows of panels show the com- 
pleteness within the nearby subsamples (v < 9000 km s -1 ) 
C = rigo CG /noo vCG , as a function of the other observable 
properties for the nearby subsample. 

For all 3 SAMs, the vHCG completeness decreases 
sharply with distance (top panel), for groups with a domi- 
nant brightest galaxy (bottom panel), while the C06 model 
also predicts a decrease of vHCG completeness at fainter 
magnitude and lower surface brightness, whereas these 
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Figure 4. Completeness of the vHCG catalogue relative to the 
pmvCG (left) and mvCG (right) mock catalogues as a function 
of distance (top panels) and for groups within v = 9000 kms — 1 
(other five rows of panels). From top to bottom the panels repre- 
sent global cumulative completeness vs. distance, vs. group total 
magnitude, vs. group surface brightness and vs. brightest group 
galaxy, and differential completeness vs. angular size and vs. dom- 
inance of brightest galaxy. The reference SAMs are Bower et 
al. (blue circles), Croton et al. (red triangles), and De Lucia & 
Blaizot (black crosses). Error bars for differential completeness 
are computed as binomial errors and are shown only for DLB 
SAM. 



trends are weaker in B06 and absent in DLB. Comparing 
the left and right sets of panels, one sees similar qualita- 
tive trends of completeness versus interesting parameter, but 
with overall completeness relative to the extended-galaxy 
mvCGs that is roughly double the value of the complete- 
ness relative to particle-based pmvCGs. Moreover, one can 
see a slightly faster trend of decreasing completeness with 
decreasing surface brightness with the C06 model. 

2.9.4 Comparison with previous studies 

Previous studies have concluded that the pHCG sample is 
incomplete at low group s u rface brightn ess (24 < < 
26 ma g arcsec" 2 ): IHicksonl (|l982T l and IWalke fc Mamonl 
(1 19891) (from the lack of low surfac e brigh tness groups), and 
iPrandoni. IovinoT fc MacGillivravl (|l994T ) (from a compari- 
son with their own automatically-selected projected SCG 
sample of com pact groups, built with very similar criteria as 
lHickson|[l982l ). We also found a large incompleteness at low 
group surface brightness when comparing the pHCGs with 
our mpCG samples. However, the sharp drop in the number 
of compact groups with low surface brightness (hr > 24) 
observed in the pHCGs, is also clearly visible in the mvCGs 
with the DLB model (but less so with the other two SAMs). 

The incompleteness in brightest galaxy counts is anal- 
ogous to the incompleteness in group num ber counts that 
iHicksonl had noticed at R — 13.0 and that IPrandoni et alj 
had already noticed at the much brighter limit of bj — 13.1 
(from the break in the slope of the number c ounts away from 
the Euclidean value of 0.6). iMamonl (|2000l ) noted that Fig. 
7 of IPrandoni et all indicates that the pHCG catalogue is 
incomplete by a factor 3 at bright magnitudes, relative to 
the SCG catalogue, while this incompleteness gets worse at 
increasingly fainter magnitudes. A closer look at their Fig. 
7 reveals that the number of groups brighter than bj — 13 is 
roughly 30 for the HCG and 2.5 for the Euclidean extrapola- 
tion of the SCG group counts to this relatively bright mag- 
nitude. Given that the solid angle of the HCG (32 000deg 2 ) 
is 25 times that of the SCG (1300 deg 2 ), the completeness 
of the HCG relative to the SCG is 30/2.5/25 = 0.48, with 
total surface densities of 30/32 000 = 0.9 x 10~ 3 deg" 2 and 
2.5/1300 = 1.9 x 10" 3 deg" 2 for the pHCG and SCG, re- 
spectively. 

This strong incompleteness at faint magnitudes is also 
evident for vHCGs, as seen in the middle right panel of our 
Fig. [3 which suggests (by matching the magnitude counts 
at intermediate magnitudes) that the HCG becomes incom- 
plete for brightest galaxy magnitudes fainter than R = 12.5, 
to the point where at R = 14.44, the differential complete- 
ness falls to roughly 5%. The surface densities of the mpCGs 
limited to magnitude brighter than R < 11.6 (roughly corre- 
sponding to bj = 13) are 7.5, 6.6, 11.8 x 10" 3 deg" 2 , for B06, 
C06 and DLB, respectively, typically 10 times that the sur- 
face density of the pHCGs. However, once we limit groups 
to the nearby subsample, the strong incompleteness at faint 
magnitudes appears barely visible (B06 and C06) or reversed 
(DLB), as seen in Figure [J] 

The incompleteness of the pHCG in do minant brightest 
galaxy groups had already b een notic e d by IPrandoni et al.l 
(1994J), who also found that IHicksonl (|1982T ) was biased in 
favour of groups where the two brightest galaxies have com- 
parable magnitudes. 
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2.10 Mock Hickson compact groups 

As noted above, the HCGs produced by Hickson's visual 
inspection cannot be reproduced by an automatic searching 
algorithm given the many biases in the selection of HCGs. 
Therefore, the nature and properties of the mock CGs that 
strictly meet the HCG criteria mentioned in Sect. 12.61 may 
be different from the properties of the HCGs themselves. 

Given the strong and progressive incompleteness of the 
HCGs in brightest galaxy counts, small angular sizes, and 
systems with strongly dominant brightest galaxies, it is es- 
sential to fold in these extra factors of incompleteness when 
building a sample that will be a good mock for the observed 
HCGs. We therefore wish to construct a mock velocity- 
selected Hickson Compact Group (mvHCG) sample, starting 
with the mvCG sample, and selecting galaxies with proba- 
bilities proportional to the completeness in 1) group surface 
brightness, 2) brightest galaxy magnitude, and 3) difference 
between the brightest galaxy and total group magnitude (i.e. 
the relative importance of the brightest galaxy). We do not 
consider the distribution of angular sizes, since this latter 
quantity is directly dependent on the three other parame- 
ters. 

Because the resulting number of mvHCGs turns out to 
be very small, rather than select mvCGs according to the 
probability that a given mvCG would be observed by Hick- 
son, we proceeded as follows. We selected the first mvCGs 
that fill the observed distribution of vHCGs for the three pa- 
rameters and stopped once one of the 10 bins in any of the 
three distributions for the mvCGs reaches the value observed 
in the corresponding bin for the vHCGs. Hence, the derived 
distributions of the three parameters do not match perfectly 
the observed ones, but are lower limits. We repeated this 
exercise, using different orders for our loop over the mvCGs 
until we matched as best as possible the observed vHCG 
distributions. This procedure was again applied on the eight 
different samples corresponding to the eight observers situ- 
ated at the eight vertexes of the simulation cube obtaining 
final samples of typically 250 mvHCGs. 

The distribution of properties of mvHCGs, shown in 
Figure [5] for the DLB model, matches much better the ob- 
served distributions of vHCGs. Similar results are found us- 
ing the other two SAMs. The three SAMs fail to find groups 
of roughly concordant magnitudes. 

The three mvHCG samples will be used to compare 
with the properties of observed vHCGs and with the corre- 
lations obtained for previous authors based on the observed 
accordant-velocity HCGs. 



3 DIFFERENT CLASSES OF COMPACT 
GROUPS 

Even though we have used redshift information to identify 
our mvCGs, the selected groups are not necessarily physi- 
cally dense in 3D real space. 

Our mvCGs can thus be split into three classes: 

• Physically dense groups (Real), 

• Chance alignments within loose groups (CALG), 

• Chance alignments within filaments (CAF). 

Keeping with the original intent of lHicksonl (| 19821 '). who had 
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Figure 5. Same as Figure [2] for the mvHCGs from the 
|Pe Lucia &: Blaizoj SAM (thin red histograms) and (again) the 
observed vHCGs (thick black dashed histograms). 

selected in projection compact groups of at least four galax- 
ies, we will classify an ravCG as Real if at least 4 of its 
galaxies form a physically dense group. Also, we will some- 
times join the CALG and CAF classes into the set of Chance 
Alignments (CAs). 

There are several ways to use the three-dimensional in- 
formation to define these classes, and as we shall see, none 
of them are perfect. 

3.1 Binding energies 

A simple way to separate the Real CGs from the CAs is to 
use the binding energy of the system. In appendix [Bl we 
show that binding energies are highly inaccurate for groups 
of masses M < 10 14 Mq, and cannot be used to distin- 
guish which CGs are physically dense and which are caused 
by chance alignments. 

3.2 Line-of-sight shape and 3D length 

Alternatively, we can classify the mock CGs using their size 
and/or elongation. We consider the 4 closest galaxie s in each 
mock CG, again in line with the original intent of lHicksonl 
1 19821 ). By closest 4 galaxies, we mean either the entire 
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Figure 6. Line-of-sight elongation vs. maximum 3D separation 
of mock velocity-filtered compact groups extracted from the De 
Lucia & Blaizot (2007) model. The red dashes delimit the Real 
mvCGs with the hybrid classification discussed below. 

mvCG if it has only 4 galaxies, or else the subgroup of 4 
with the smallest 3D length. We use the following notations 
for these smallest quartets: 

• s: maximum 3D separation, hereafter 3D length; 

• S±: maximum projected separation, hereafter projected 
size; 

• S»: maximum line-of-sight separation, hereafter line-of- 
sight length; 

• S\\/S±: hereafter, line-of-sight elongation; 

• round mvCG: S\\/S± < 2; 

• elongated mvCG: Su/S± ^ 2. 

Figure [6] shows how the line-of-sight elongation is re- 
lated to the maximum 3D separation. The points in the 
lower left part of Figure [6] show uncorrelated line-of-sight 
elongation and 3D length, as expected for Real groups, while 
the upper right part of the figure shows instead a strong cor- 
relation of line-of-sight elongation with 3D length, indicative 
of CA groups. Which cuts in line-of-sight elongation and 3D 
length separate best the Real CGs from the CAs? The choice 
of the critical 3D length, s cut , is not straightforward, as we 
shall now discuss. 

3.2.1 Matching line-of -sight elongation of real- space 
selected groups 

We first tried varying s cu t by imposing that the median line- 
of-sight elongation be equal to that of real-space-selected 
groups. We measured a median line-of-sight elongation of 
0.725 for the PG3Ds. We also checked this median value of 
the line-of-sight elongation with Monte-Carlo simulations of 
quartets distributed at random in a virial sphere with an 
NFW density profile with concentration r v /r s = 10, where 
we then elongated the sphere in two orthogonal directions 



Figure 7. Upper panels: Median line-of-sight elongation vs. crit- 
ical 3D length, s C ut, for the 4 galaxies in the richest subclump 
of mock velocity-accordant compact groups. Dashed (solid) lines 
refer to values of s cu t where the sample of mvCGs has less (more) 
than 50 mvCGs. The red solid horizontal lines correspond to the 
median line-of-sight elongation measured by selecting 4 galaxies 
at random from each PG3D group. Lower panels: Normalised cu- 
mulative distribution of 3D lengths, considered as critical separa- 
tions between the Real and CA classes. The green arrows indicate 
the value of s cu t that matches the median line-of-sight elongations 
of the PG3D groups (vertical arrows) yielding the corresponding 
fraction of Real CGs for the adopted s cu t (horizontal arrows). 

by two factors to make it a triaxial ellipsoid, and observed 
it from a random direction, and repeated this exercise 5000 
times. We then find median line-of-sight elongations of 0.782 
(in spheres) and 0.719 (in tr iaxial ellipsoids, wit h b/a = 0.79 
and c/a — 0.65, as found bv ljing fc Sutol ()2002l ). on average, 
in ACDM halos at overdensity 100). This median line-of- 
sight elongation for triaxial halos is very close to what we 
measured for the PG3Ds. 

We also considered the cores of virialized groups, where 
the overdensity is 10 , close to how overdense HCGs appear 
to be. Here we, limited the particles to a radius of 0.025 virial 
radii, i.e. 0.25 scale radii (with our concentration of 10), 
where the mean density is roughly 1000 times greater than at 
the virial radius. Noting that A CDM halos a re less spherical 
at overdensities as high as 10 5 l|jing fc Sutol find b/a = 0.61 
and c/a — 0.46, on average), we consider these less spherical 
halos and then find a median line-of-sight elongation of 0.722 
(even closer to the median elongation of the PG3Ds). In 
general, the median line-of-sight elongation is much more 
sensitive to the triaxiality of the object than to the slope of 
its density profile. 

As seen in the top panels of Figure [7] the values of s cut 
required for the shortest mvCG to reproduce the median 
line-of-sight elongation of the real-space selected PG3Ds are 
fairly small and vary from SAM to SAM, from ~ 50 h~ x kpc 
for DLB to ~ 80/i _1 kpc for B06, with C06 in between. 
The bottom panels of Figure [7] indicate that the fraction of 
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Figure 8. Projected size versus line-of-sight length for the small- 
est quartets within mvCGs, using the De Lucia & Blaizot (2007) 
SAM. Also shown are log-spaced contours (red), the medians in 
bins of 200 points (green jagged line) the interquartiles (magenta 
jagged lines), and the line y = x (thin blue line). 

mvCGs whose 3D length of their smallest subclump of four 
galaxies is less than the corresponding s cut varies strongly 
with the SAM: 40% with B06, 20% with C06, but only 4% 
with DLB. This should not lead us to conclude that most 
mvCGs are CAs, because the minimum 3D length (s cut ) for 
CAs with the DLB model is only half the median projected 
size of the DLB mvCGs, which does not seem reasonable. 
In other words, it is not reasonable to force redshift-space 
selected groups to be as round as real-space selected ones: 
redshift-space selection will always produce somewhat more 
elongated groups than real-space selected ones. 

3.2.2 Line-of-sight elongation versus line-of-sight size 

Alternatively, one could argue that CAs should be long in 
the absolute, i.e. high s, and/or relative to their projected 
sizes, i.e. high S\\/Sj_, e.g. S\\/S± > 2. 

If the projected sizes were independent of the line-of- 
sight lengths, as would be expected if all CGs were CAs, 
we could then impose a value of s cu t that would be close to 
\/2 times the upper envelope of S± (since round CGs would 

have s ~ ^Js 2 ± + Sf ~ V2S±). 

Figure [8] shows that the mvCGs behave differently: 
while at high line-of-sight length, where CAs are expected 
to be dominant, the projected size is indeed independent 
of the line-of-sight length, at low line-of-sight length, where 
CAs are not dominant, the projected size increases with in- 
creasing line-of-sight length. So the upper envelope of the 
projected sizes is not a clear-cut value. The contours sug- 
gest a close to linear increase of S± with Su in the low S\\ 
regime, as expected for systems of same line-of-sight elonga- 
tions and different sizes. 



The transition between these two regimes is difficult 
to ascertain. One way is to look for the value of S\\ for 
which the median S± changes from a high to low slope. 
This yields a critical Sit of « 140 kpc for the DLB model 
(see Fig. U), 120 h' 1 kpc for the B06 model and 165 h' 1 kpc 
for the C06 model. These 3 critical values of S« correspond 
to S± ~ 100,105 and 112 h' 1 kpc, for the B06, C06 and 
DLB models, respectively. One therefore infers critical group 
length of s cut = 156, 196 and 179 h' 1 kpc, for the B06, C06 
and DLB models, respectively. 

The fraction of mvCGs with lengths smaller than these 
three values of s cu t can then be read from the bottom panel 
of Figure |5] one finds 72%, 72% and 59% of the groups have 
s < s cut for the B06, C06 and DLB models, respectively. 
One would therefore deduce that between half and three- 
quarters of the mvCGs are Real (depending on the SAM). 
However, given the crudeness of the method, one should take 
these percentages with caution. 



3.2.3 Reasonable cuts in length and line-of-sight 
elongation 

We now explore whether reasonable limits on s cut and 
S\\/S± can reduce substantially the fraction of Real mvCGs. 
Figure [9] displays the distributions of the 3D length, s, for 
the DLB model. 

The distribution of 3D lengths clearly shows a dom- 
inant log-normal component and a second component of 
more extended lengths. In fact, when restricting to round 
groups (S\\/S± < 2), the distributions of 3D lengths ap- 
pear very close to lognormal. Moreover, the round mvCGs 
tend to be smaller with the Bower model ((s) = 78 h -1 kpc) 
and larger with the DLB model ((s) = 125 /i -1 kpc), with 
the predictions from the Croton model in between. Fi- 
nally, the distribution of elongated mvCGs (Sn/S± ^ 2) 
is wider than that of the round mvCGs, and centred around 
s ~ 250 h' 1 kpc for the B06 and C06 models and 300 h' 1 kpc 
for the DLB model. It displays an extended tail of very large 
(> 2/i _1 Mpc) 3D lengths. 

The discussion above suggests a conservative maximum 
for the Real mvCGs of s cut < 200 h~ kpc. 

3.3 Fraction of CAs in different samples of mock 
CGs 

Table [5] summarises the fractions of CGs satisfying var- 
ious criteria that could classify them as Real. With our 
choice of s cut = 200 ft -1 kpc, we obtain fractions of Real 
mvCGs of 0.80 (Bower), 0.73 (Croton) and 0.64 (DLB). 
To be more favourable to the CAs, we can include addi- 
tional elongated mvCGs. However, it makes no sense to call 
a CA an elongated mvCG with a very small 3D length, for 
example with s = 50 h~~ x kpc, because such an mvCG is 
also a physically dense group, hence a Real. So, we con- 
sider a simple hybrid classification (dashed lines in Fig. [6]), 
where the CAs are the mvCGs with s ^ 200 ft -1 kpc OR 
(s > 100 h' 1 kpc AND S| ( /S± > 2). We then find (Tabled 
that the fraction of Real mvCGs is 0.76 (Bower), 0.67 (Cro- 
ton), and 0.59 (DLB). We hereafter adopt this hybrid cri- 
terion to estimate the fraction of Real and CA compact 
groups. 
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Figure 9. Top: Distribution of 3D lengths for the DLB model for 
all mvCGs (solid histograms pushed up by 20% for clarity), the 
round mvCGs (S^/S± < 2, dashed histograms), and the elon- 
gated mvCGs (5||/Sx 2, dotted histograms). The thin solid 
blue curve shows the total distribution for the pmvCGs. Bottom: 
Cumulative distribution of 3D lengths of all mvCGs. 



Table 5. Fraction of Real mock CGs using different criteria 



Criterion 




B06 


C06 


DLB 


pmpCG 


s < 200 ft- 1 kpc 




0.46 


0.39 


0.22 


S\\/S x < 2 




0.43 


0.38 


0.23 


s < 100 h- 1 kpc OR 










(s < 200 ft" 1 kpc AND Sy/Sj 


<2) 


0.43 


0.36 


0.20 


rnpCG 


s < 200 h" 1 kpc 




0.46 


0.47 


0.38 


S\\/S x < 2 




0.46 


0.46 


0.43 


s < 100 ft- 1 kpc OR 










(s < 200 ft- 1 kpc AND Sy/Si 


<2) 


0.44 


0.43 


0.35 


pmvCG 


s < 200 k- 1 kpc 




0.77 


0.70 


0.59 


S\\/S± < 2 




0.71 


0.68 


0.61 


s < 100 ft" 1 kpc OR 










(s < 200 ft" 1 kpc AND Sy /Sj. 


<2) 


0.72 


0.64 


0.53 


mvCG 


s < 200 ft" 1 kpc 




0.80 


0.73 


0.64 


S\\/S± < 2 




0.80 


0.72 


0.71 


s < 100 ft- 1 kpc OR 










(s < 200 ft" 1 kpc AND Sy /S ± 


<2) 


0.76 


0.67 


0.59 


mvHCG 


s < 200 ft" 1 kpc 




0.91 


0.83 


0.76 


S\\/S± < 2 




0.84 


0.73 


0.76 


s < 100 ft" 1 kpc OR 










(s < 200 ft" 1 kpc AND Sy /S_l 


<2) 


0.86 


0.75 


0.70 



Table 6. Fraction of chance alignments that extend beyond their 
parent loose group (CAFs) 



Sample 


B06 


C06 


DLB 


pmvCG 


0.32 


0.30 


0.30 


mvCG 


0.24 


0.24 


0.17 


mvHCG 


0.16 


0.26 


0.26 



It therefore, appears that more than half of the mvCGs 
are physically dense, although there are important vari- 
ations betwe en the three galaxy formation models, with 
Bower et al. predict ing the most physically d ense mvCGs, 
De Lucia fc BlaizotJ predicting the least, and ICroton et al.l 
in between. 

If we only consider CGs identified in projection 
(mpCGs), the Real CGs represent between 35% and 47%, 
depending on the criteria and on the SAM (Table [S]). 

We note that the fraction of CAs diminishes in all 
three SAMs when going from the pmvCG to mvCG and 
to mvHCG samples, i.e. when first taking into account the 
extended nature of galaxies causing confusion, and then in 
in corporat i ng the biases we measured in the visual selection 
of lHicksonl [1 19821 ). Table[5]thus indicates that the fraction of 
CAs is 28-47% for the pmvCGs, but is reduced to 24-41% 
for the mvCGs and only 14-30% for the mvHCGs. 



3.4 Chance alignments within Loose Groups and 
beyond 

We now consider as CAs all mvCGs with 3D lengths s 
200 ft- 1 kpc, regardless of the line-of-sight elongation. The 
distinction between CALG and CAF is very simple, as we 



simply check whether all CALG members lie within a single 
PG3D (making it a CALG) or not (making it a CAF). 

Table [6]shows that large majority of the CAs (typically 
three-quarters) are CALGs, regardless of the SAM and the 
sample. Thus, alignments within filaments or with galax- 
ies in the field are much less likely than chance alignments 
within larger groups or than having physically dense groups. 



3.5 Isolated dense groups? 

The standard picture is that dense groups of galaxies 
are the cores of virialized looser groups. However, some 
HCGs appear extrem ely isolated (|Rood fc Williams 1989; 
iPalumbo et~aH Il995l ). The simulations that we have anal- 
ysed allow us to check whether dense groups can be isolated 
out to the virial radius. We have simply cross-identified the 
mvCGs with the real-space-selected PG3Ds. We then call a 
CG isolated if it constitutes the entire PG3D (in the magni- 
tude range determined by the brightest galaxy of the CG). 
We then find that only ~ 11% of the mvCGs constitute the 
entire PG3D in all the three SAMs. In the mvHCG samples, 
the fraction of isolated CGs are somewhat smaller (7% for 
B06, 6% for C06, and 8% for DLB). 
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4 DISCUSSION AND CONCLUSIONS 
4.1 Summary of results 

The aim of this paper is twofold: 1) predicting the nature 
of automatically identified CGs, and 2) predicting the na- 
ture of the well-studied but highly incomplete and biased 
HCGs. We identify CGs in three mock galaxy catalogues, 
constructed from the Millennium Simula tion at z = com- 
bined with three semi-analytical models jBower et al J 1 20061 ; 
ICroton et al"1l2006l ; |Pe Lucia fc Blaizotll2007l ). Several thou- 
sand mock CGs are identified using a two di mensiona l auto - 
mated algorithm similar to that applied bv lfficksonl (|l982h 
plus a restriction in the brightest galaxy magnitude. We also 
allowed for CGs that contain isolated and compact sub- 
groups and furthermore considered the important effects 
of extended galaxies causing confusion for close projected 
pairs. 

Our main results are: 

(i) Among the observable mpCGs, ~ 60% have at least 
4 galaxies within 1000 km s -1 of the median velocity of the 
group, regardless of the SAM. Since our study is carried out 
on a sample with a redshift cut-off (z ~ 0.17), we tested 
whether our results might be bias by this fact. We find 
that identifying in our (magnitude-l-) volume limited cata- 
logue produces 1.6 times more mvCGs than identifying on 
an only-magnitude limited catalogue, because our box sam- 
ple misses distant galaxies that spoil the isolation of mock 
Compact groups. Correcting for this effect, we deduce that 
the fraction of mpCGs that survive the velocity filter is 50%. 
In comparison, there are as many as 52 vHCGs among 72 
pHCGs, meaning that 71% of observed HCGs survive the 
velocity filter. Given binomial statistics the probability that 
we find as many as 52 given that 36 (half of 72) are expected 
is negligible (P = 0.0%). 

(ii) Comparing the space densities of the mvCGs and the 
vHCGs, we deduce that the HCG catalogue is only 8% com- 
plete. A comparison of the parameter distributions between 
the mvCGs and vHCGs indicates that the HCG is incom- 
plete in groups of small angular sizes, high fraction of light 
in the brightest galaxy, as well as (for the C06 model) faint 
brightest galaxy magnitudes and low surface brightness. 

(iii) We find that the velocity filtering of mpCGs does not 
necessarily imply that the resulting accordant-velocity CGs 
are physically dense. We tested different criteria to classify 
the accordant-velocity CGs according to the maximum 3D 
galaxy separations and the line of sight elongations. We find 
that, with the most conservative criterion, at least 3/5 of the 
mock accordant-velocity CGs are physically dense, although 
the precise fraction depends on the galaxy formation model 
used. 

(iv) The large majority of non-Real mock accordant- 
velocity CGs are caused by chance alignments within larger 
groups, rather than within larger regions such as large-scale 
filaments. 

(v) We find that the fraction of chance alignments de- 
creases from 28-47% for the particle-based pmvCGs to 24- 
41% for the sample with close pairs removed (mvCGs) 
to only 14-30% once we fold in the biases of the HCG 
(mv HCGs). This explains in part why s imulation stud- 
ies (|Mamonl Il986l ; IWalke fc Mamonl 1 19891 ; iHernquist et all 



Il995h predict more chance alignments than one infers from 
actual observations. 

Table [4] indicates that typically half of the groups are 
lost once we apply the criterion to blend close projected pairs 
of galaxies. While the discarded groups no longer satisfy the 
selection threshold of 4 galaxies within 3 magnitudes from 
the brightest, there are still many blended pairs within the 
groups that survived the blending criterion (in 1/3, 1/4, and 
1/8 of the mock CGs for B06, DLB, and C06, respectively, 
with similar fractions for mpCGs and mvCGs). Could this 
mean that a significant fraction of the pHCGs and vHCGs 
contain blended galaxies? 

A comparison of the lists of | Hicksonl lll982h ( selected 
from photographic plates) and lHickson et aL (|l989T ) (CCD- 
based) indicates that the latter found 10 HCGs with extra 
galaxies (24, 26 [+3], 27, 43, 51, 70, 72, 76[+2], 83 and 99), 
plus one with one galaxy less (HCG 40). So 10% of the orig- 
inal HCG groups contained blended galaxies as discovered 
with the better CCD photometry. It therefore appears that 
our fractions of 1/8 to 1/3 of mpCGs with blended pairs is 
high. But if the truth is in between our particle case and 
our extended case, then we should have of order 6-17% of 
groups with blended galaxies, which is in rough agreement 
with what we see in the HCGs. 

Now, three HCGs have been observed at much higher 
resolution with HST imaging. One shows no extra galaxies 
(HCG 87), while the other two show m ore interacting units 
than counted by iHickson et al.l (|l989T l: HCG 31 seems to 
have 7 galaxies and not just 4, while HCG 90 has 5 galax- 
ies and not 4. Binomial statistics suggest that, with 95% 
confidence, the view of 2 groups out of 3 with extra galax- 
ies implies that the fraction of such groups with blended 
pairs is between 14% and 86%. But HST is probably bi- 
ased towards dense interacting HCGs. Still, there could be 
interacting pairs showing ga laxies that have bee n blended 
even with the CCD images of lHickson et al.l (|l989l ). So, with 
the high resolution of the HST, the fraction of pHCGs and 
vHCGs with blended pairs may be considerably higher than 
10% and in agreement with the fractions found in the mock 
CGs. 

We can also estimate the number of HCGs that are 
physically dense groups of at least 4 galaxies. As discussed 
in Sect. 12.11 the HCG catalogue has 100 members, among 
which 99 are compact groups (since HCG 54 is a collection 
of H II regions) , of which only 83 actually fulfil the original 
magnitude concordance criterion (i?-band magnitude range 
less than 3). Among the 72 pHCGs whose brightest and 
faintest galaxies are brighter than R — 14.44 and R — 17.44, 
respectively, only 52 (72%) have at least 4 accordant veloci- 
ties, and among these, we expect roughly between 36 and 44 
HCGs that are physically dense groups of at least 4 galaxies. 
Extrapolating to the 68 accordant-velocity HCGs (including 
those with magnitude range greater than 3 mags) , we expect 
no more than ~ 58 physically dense HCGs with at least 4 
galaxies. 

In comparison. iMamonl l|l986h had predicted that 47 out 
of what he thought would be 78 accordant velocity HCGs 
are caused by chance alignments (60%), while the remain- 
ing 40% are physically dense (but he predicted that half of 
these dense groups were u nbound s ystem s). We are there- 
fore less pessimistic than IMamonl (1 19861 ) on the fraction 
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of chance alignments polluting the HCG catalogue, since 
chance alignments appear to represent between 14% and 
30% of the mvHCGs (we were not able to check what frac- 
tion of the Real ones are unbound: see Appendix [Bj . Part 
of this discrepancy is caused by Mamon's (1986) reliance on 
simulations without consideration of selection effects such 
as observers blending close projected pairs of galaxies. Still, 
the percentage of chance alignments in the particle mock 
velocity-filtered compact groups (pmvCGs) is only 28-47% 
(depending on the SAM). Nevertheless, given the wide range 
of chance alignments fractions among the three SAMs, one 
cannot rule out that a more realistic galaxy formation model 
would lead to as much as 60% of chance alignments. 

4.2 Comparison with McConnachie et al. 

iMcConnachie et al.1 (|200Sf ) (MEP) have published a study 
very similar to ours: they also extracted pmpCGs from the 
DLB model obtained from the Millennium dark matter sim- 
ulations. Their sample extended to r = 18, which corre- 
sponds to roughly one-quarter of a magnitude fainter than 
our limit of R < 17.44. Their other pmpCG criteria appear 
to be almost exa ctly the same as ours (following the criteria 
oflHick son 198 2]), although their algorithm works differently 
(|McConnachie et al.ll2009h . MEP found a total of over 15 000 
pmpCGs over 4-7rsr. 

Using precisel y the same input galaxy catalogue (from 
iBlaizot et aT]|2005l ) as MEP, we find 25 000 pmpCGs, so, our 
algorithm is nearly 1.6 times more efficient than MEP's in 
finding pmpCGs. Surprisingly, if we build a light cone as we 
did in Sect. 12.81 with apparent magnitude limit of R = 17.67 
(~ rsDss = 18), we ob tain a mock ga laxy catalogue that is 3 
times denser than the IBlaizot et al.l mock galaxy catalogue 
used by MEP. From our mock galaxy catalogue, we extract 
15 191 pmpCGs in 1.2693 sr, which means that, with the 
data used in this work, we are ~ 10 times more efficient 
than MEP in finding CGs, principally by differences in the 
parent samples of galaxies, but also by differences in the CG 
detection algorithm. 

MEP and us agree that a significant fraction of mpCGs 
are caused by chance alignments: MEP found 71% of their 
pmpCGs are CAs while we find 80% (with our hybrid clas- 
sification, see Table [SJ. 

There are, however, several important differences be- 
tween our two studies: 

• MEP build their sample from a mock that extends be- 
yond the box size of the Millennium simulation — using 
the output o f the Mock Map Facility (MoMaF) code of 
IBlaizot et all (|2005h . while our mock galaxy catalogues are 
limited to the size of the simulation box. However, as shown 
in Sect. 12.81 working on a light cone or working on a single 
simulation box leads to similar numbers of mock CGs (we 
identify a factor of 1.2 fewer mpCGs, and 1.5 more mvCGs). 

• MEP consider CGs with a faint magnitude limit, while 
we also tie in a bright magnitude limit to ensure that all 
mock CGs were built from galaxies that spanned a range of 
over 3 magnitudes. 

• We have analysed the galaxies from 3 different SAMs, 
while MEP have only considered the DLB sample, which we 
found to produce the smallest fraction of physically dense 
mpCGs. 



• MEP only provide statistics for the mock CGs de- 
fined in projection (pmpCGs, which they refer to as 'HA's) 
but do not consider the subset of accordant-velocity groups 
(pmvCGs). We t hink this woul d have been worthwhile be- 
cause ever since iHickson et all (|l992T l published the HCG 
galaxy redshifts, most analyses have thrown out the discor- 
dant velocity HCGs. 

• MEP did not consider selection effects, while we con- 
sidered both the galaxy confusion from close, blended, pro- 
jected pairs, as well as the biases that we determined for the 
Hickson's visual selection of the HCGs. 

• MEP only considered those mpCGs with k ^ 3 galaxies 
that lie very close in real space, while we considered k ^ 4 
(to be consistent with Hickson's initial motivation to have 
at least 4 galaxies per HCG). 

• MEP define the Real mpCGs using a Friends-of-Friends 
linking length in real space, while we use a maximum 
real-space separation and the elongation along the line-of- 
sight. Structures built from small numbers of components 
with Frien ds-of-Friends algorithms tend to be more filamen- 
tary (e.g. iMoore. Frenk. fc Whit3ll993l ). For mpCGs that 
are CALGs or CAFs, the most distant outlier will deter- 
mine a similar maximum length and critical linking length. 
However, for mpCGs without outliers (e.g. Real mpCGs), 
the linking length will be smaller than the 3D length. In 
other words, selecting Real groups with a linking length 
of 200 ft -1 kpc will result in group 3D lengths considerably 
greater. Moreover, for mpCGs with both foreground and 
background galaxies, MEP's £ must be compared to our 
ftai/-maximum size s cut /2, and there is here a discrepancy of 
a factor two. Worse, for those (admittedly rare) cases of, say 
4, galaxies aligned along the line of sight at roughly equal 
separations just below £, one will end up with a group that 
spans up to 31 = 600 ft -1 kpc, which is now three times 
our maximum 3D length, but will still be called Compact 
Association (Real) by MEP, although it clearly is a chance 
alignment. In summary, this point and the previous one im- 
ply that MEP's criterion for calling an mpCG Real is much 
more liberal than ours. 



4.3 Perspectives 

In forthcoming papers, we will analyse the distribution of 
and correlations between the physical characteristics of the 
mvCGs, and show how they depend on their classification, 
in view of optimising the probability that a CG selected 
in redshift-space is physically dense. It would be worth- 
while to probe the formation of the physically dense CGs 
by analysing the merger trees of galaxies in the mock CGs. 
Finally, the analysis presented here will need to be confirmed 
with increasingly realistic simulations of galaxy catalogues, 
for example constructed from future galaxy formation mod- 
els run on the re cent high resolution Millen nium-II dark 
matter simulation jBovl an-Kolchin et al]|2009T l. and also on 
future high-resolution cosmological hydrodynamical simula- 
tions, with realistic prescriptions for feedback from AGN 
and supernovae. 
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APPENDIX A: FROM R-SDSS TO R-JOHNSON 
APPARENT MAGNITUDE LIMIT 



Accor ding to Table 3 of iFukugita. Shimasaku, fc Ichikawal 
(|l995h , r-R has typical values of 0.36 (E), 0.31 (SO), 0.33 
(Sab), 0.32 (Sbc), 0.30 (Scd), and 0.20 (Im), and we adopt 
(r — R) = 0.33. We thus restrict the HCG sample to a total 
extrapolated R band extinction-corrected magnitude 



R° T = B^~(B-R) iBO +(B T -B% 



1 - 



A R /A V 



< 17 AA , 
where we adopted Ab/A\ 



opted As/Ay = 1.33 and An/ Ay 
jCardelli et al.ll989l ). Equation (JXTJ assumes no B-R colour 



(Al) 



0.75 



gradient in the galaxy. 



APPENDIX B: GROUP BINDING ENERGIES 

The binding energy of the particles of a mock CG is difficult 
to determine because it is not clear which particles belong 
to the CG. 

One could alternatively use the galaxies instead of the 
particles. But most of the mass of a group is thought to lie 
in between the galaxies, so one must be careful on how the 
binding energy analysis is performed. We separate the group 
into the system of galaxies and the remaining intergalactic 
dark matter. The kinetic energy of the group would then be 

3 



K = K S + K d 



(M g + M d 



(Bl) 



where M s and Md are the masses of the galaxies and of the 
intergalactic dark matter, respectively, while a v is the one- 
dimensional velocity dispersion assumed to be the same for 
the intergalactic dark matter and the galaxies, and where we 
placed ourselves in the group centre-of-mass to get rid of the 
bulk kinetic energy. The potential energy is more difficult to 
handle as it is the sum of the potential energies of the galaxy 
system, the intergalactic dark matter system and the cross- 
term between galaxies and the intergalactic dark matter. 
Suppose a group has a factor fi times more intergalactic 
dark matter than mass in galaxies. From equation (JBTJ , the 
kinetic energy is then 



K = (/x + 1) K g 



(B2) 



If the galaxies and the dark matter have similar distributions 
in space (which is not really possible, since by definition the 
intergalactic dark matter is outside the galaxies), then the 
total potential energy can be written 



W 



VF g , g + W d ,d + W s ,d 



GMZ 



GMj 
ra 



GM s M d 



<•<:; 



( M 2 + fi + 1) W g , g , 



(B3) 



where tq is the gravitational radius, assumed to be the same 
for the three terms. So, if the group is in virial equilibrium, 
one has 2 K + W — 0, then according to equations (|B2|l and 
(IB3I). one finds that 



(B4) 



2 (/i + 1) K s + (m + /i + l) W g , s = 
Therefore, one finds that 
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2K S 



n + 1 



(B5) 



so, if the intergalactic dark matter makes up for say ^ = 4 
times as much as the galaxy mass, then if the group is in 
virial equilibrium, equation (|B5|I leads to — 2 K S:S /W gtg = 
21/5 = 4.2. Therefore, the virial ratio of the galaxies in a 
group can be far off from unity! It will depend on the fraction 
of mass in galaxies (i.e. on + 1), hence on jj,). Finally, 
since the virial ratio of the galaxy system appears to be 
greater than 2, we would incorrectly conclude that typical 
groups are unbound! 

One way to avoid these problems is to assign to each 
galaxy the fraction of the total group mass equal to the 
ratio of its mass divided by the total mass in galaxies for 
that group. In other words, we are putting the intergalactic 
dark matter mass in each galaxy in proportion to its mass. 
This is equivalent to /j, — 0, hence to a virial ratio of unity 
according to equation ()B5|I . 

The simulation data that we have at our disposal pro- 
vides the virial masses of the PG3D groups (dark matter 
included). If a mock CG is a CA, then we cannot know how 
much dark matter is assigned to this CA, but only to the 
PG3D group associated with it. We therefore choose to scale 
the galaxy masses to the PG3D group mass, i.e. 



MpG3D 



(B6) 



£PG3D 



Still, there remains the issue of CG galaxies that do not be- 
long to any PG3D. One possibility is to apply equation (|B6[) 
to the CG galaxies that lie within PG3D groups, without 
scaling the masses of the isolated galaxies. The alternative 
is to scale by the fraction of mass in the whole simulation 
box, instead of the PG3D group. 

Another issue is that the virial spheres around the 
galaxies will tend to overlap inside the Real CGs. One there- 
fore needs to soften the galaxy-galaxy potential e nergy of 
inter action, for example with the approximation (|Mamonl 
1 19871 ): 



Vint {n,j) = 



where rj 



Gm'm' 



r" ■ + r • 



(B7) 



V 

is the root mean squared of the half-mass radii 



of the pair of galaxies {i, j}. The virial theorem becomes 



where the Clausius virial of a group is 



(B8) 



3r> 



r" ■ + r- ■ 

% : j ' ij,rms 



3/2 



= G S m'jm'j ■ 



, . _ j fy 1 ^ 

, i,J ij,rms 



3/2 



(B9) 



where r^j is the vector separating galaxies i and j. 

But how do we estimate the galaxy half-mass radii? We 
can compute analytically the half-mass radius of the mat- 
ter within the virial radius, say for an NFW model, with a 
concentration c = r v /r_2 = 10 (where r_ 2 is the 'scale' ra- 
dius of slope —2), for which r a /r v ~ 0.36 (|Lokas fc Mamonl 
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Figure Bl. Virial ratio as a function of virial mass for PG3D 
groups with N > 10 members extracted from DLB model, where 
the galaxy mass correction (eq. IB6H is over the entire box (top 
panel) or the parent group (bottom panel). The solid red line 
shows the expected virial ratio of unity, while the dashed line 
shows the limit for unbound groups. 



l200ll . Fig. 4 and eq. [28]). But we could also compute the 
half-mass radius within a larger radius, say the turnaround 
radius beyond which the Universe is expanding, and which 
is typically 3.5 times the virial r adius. Assuming t hat the 
NFW model extends that far (see lPrada et al.ll2006h . going 
to the turnaround radius amounts to increasing the concen- 
tration by a factor of rt a /V v ~ 3.5. So, if c = 10 for a galaxy, 
at the turnaround radius, we would use c = 35 and find 
r h /r ta ~ 0.23, i.e. r h /r v ~ 0.79. 

But then, how do we estimate the mass within the 
virial radius of the galaxy? We could guess a mass- 
to-light ratio M(r\)/L B = 100, although M(r v )/L B 
is thought to decrease with i ncreasing l umino sity to 
reach a minimum around 70 (|Eke et al.l 120061 ). Then 
r v = [2/A(GM/Hi)] 1/3 = [2/A {G(M/L)L/H 2 )} 1/3 = 
544(L/10 11 ) 1/3 kpc for A = 100 (as we used in the paper), 
Hq — 73kms~ 1 Mpc -1 (as used in the Millennium Simula- 
tion) and M/L = 100. For L» = 0.18 x 10 11 L Q , we end up 
with r v = 307 kpc. So for L* galaxies, we need a softening 
of typically r = 70 to 250 kpc, i.e. c = r /L 1/a = 0.03 
to 0.10. Of course, the higher the softening scale, the less 
negative is the potential energy and the less bound is the 
system. 

We test our prescription by computing the virial ratios 
of the PG3Ds using both PG3D scaling and box scaling of 
the galaxy masses, with different values for the softening 
scale Co- The correct scaling must lead to virial ratios of 
unity, independent of group mass. 

Figure lBll shows the results of our test on PG3D groups. 
For both normalisations, we find that the softening Co = 
0.03 kpcZ/g 1 ^ 3 (corresponding to ro = 70 hT 1 kpc for L — L, 
galaxies) bring the virial ratios of the highest mass PG3Ds 
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to unity. Without the softening, the potential energies are 
overestimated (in absolute value), hence the virial ratios are 
underestimated, while with too strong softening the virial 
ratios are overestimated. 

However, Figure [BT1 indicates that even with the correct 
softening, i.e. with correct virial ratios at the high mass end, 
there is so much scatter in the virial ratios at low masses, 
that over 13% of the PG3Ds are found to be unbound for 
PG3D virial masses below4xlO 13 M . This means that for 
the typical masses of the CGs, our virial ratio estimator is 
too inaccurate to use as a CG classifier. 



